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Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



April 10, 2008, 00:54:56 ; Search time 660 Seconds 
(without alignments) 

14410.097 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-529-592A-1 
881 

1 gggccatgacccccgctgct . 



. aaataaagatcctctgtaac 881 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 9073515 seqs, 5397694045 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database : N_Geneseq_2 0 0 711 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000 : * 

4: geneseqn2001a : * 

5: geneseqn2001b : * 

6: geneseqn2002a : * 

7: geneseqn2002b : * 

8: geneseqn2003a : * 
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9: geneseqn2003b : * 

10: geneseqn2003c : * 

11: geneseqn2003d: * 

12: geneseqn2004a: * 

13: geneseqn2004b: * 

14: geneseqn2004c : * 

15: geneseqn2004d: * 

16: geneseqn2005a : * 

17: geneseqn2005b: * 

18: geneseqn2005c : * 

19: geneseqn2006a : * 

20: geneseqn2006b: * 

21: geneseqn2006c : * 

22: geneseqn2007 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


881 


100 


0 


881 


12 


ADM96967 


Adm96 96 7 Human pan 




2 


881 


100 


0 


881 


22 


AES72471 


Aes72471 Human C19 




3 


827 


93 


9 


893 


12 


ADM96969 


Adm96 96 9 Human pan 




4 


827 


93 


9 


893 


22 


AES72473 


Aes72473 Human C19 




5 


703.2 


79 


8 


761 


4 


AAK88446 


Aak88446 Human dig 


c 


6 


694.2 


78 


8 


963 


4 


AAI90742 


Aai90742 Human pol 


c 


7 


694.2 


78 


8 


963 


9 


ADE09694 


Ade0 96 9 4 Novel DNA 


c 


8 


611 


69 


4 


614 


13 


ADR26670 


Adr26670 Breast ca 




9 


549 


62 


3 


2798 


4 


AAK90422 


Aak90422 Human dig 




10 


547.4 


62 


1 


2804 


4 


AAK90423 


Aak90423 Human dig 




11 


367 


41 


7 


574 


6 


ABL65256 


Abl65256 Lung cane 




12 


206 . 4 


23 


4 


1617 


9 


ADE07493 


Ade07493 Novel cod 




13 


203 


23 


0 


203 


7 


AFS82561 


Afs82561 Human tra 




14 


147 . 2 


16 


7 


550 


12 


ACH77410 


Ach77410 Human gen 




15 


144 


16 


3 


170 


12 


ACH91110 


Ach91110 Human gen 




16 


105 . 4 


12 


0 


2721 


6 


ABS61407 


Abs61407 Prostate 




17 


75 . 4 


8 


6 


763 


6 


ABQ27175 


Abq27175 Oligonucl 


c 


18 


75 . 4 


8 


6 


763 


6 


ABQ27174 


Abq27174 Oligonucl 




19 


49 . 8 


5 


7 


2000 


8 


ADA71938 


Ada71938 Rice gene 


c 


20 


49 


5 


6 


763 


6 


ABQ27177 


Abq27177 Oligonucl 




21 


49 


5 


6 


763 


6 


ABQ27176 


Abq27176 Oligonucl 


c 


22 


47 


5 


3 


1140 


8 


ADB72831 


Adb72 831 Human LFN 


c 


23 


47 


5 


3 


1140 


8 


ADB72832 


Adb72832 Human LFN 


c 


24 


47 


5 


3 


1140 


9 


ADA66377 


Ada6 63 7 7 Human LFN 


c 


25 


47 


5 


3 


1140 


9 


ADA663 78 


Ada6 63 78 Human LFN 



http://es/ScoreAccessWeb/GetItem.action?AppId=1052959...10_005435_ 



-10-529-592a-l.rng&ItemType=4&startByte=0 (2 of 44)6/7/2008 9:10:21 PM 



SCORE Search Results Details for Application U 1529592 ind Search Result 20080410_005435_us-10-529-592a-l.rng. 



c 


26 


47 


5 . 


,3 


1140 


9 


ADA03094 


Ada03094 Human LFN 


c 


27 


47 


5 . 


.3 


1140 


9 


ADA03093 


Ada03093 Human LFN 


c 


28 


47 


5 . 


.3 


1140 


10 


ADL27172 


Adl27172 Human cod 


c 


29 


47 


5 . 


.3 


1377 


16 


ADZ12748 


Adz 12 748 Human can 


c 


30 


47 


5 . 


.3 


1545 


2 


AAV41906 


Aav41906 Nucleotid 


c 


31 


47 


5 , 


.3 


3025 


16 


ADZ12756 


Adzl2756 Human can 


c 


32 


47 


5 . 


.3 


29040 


11 


ADL27170 


Adl2 717 0 Human gen 


c 


33 


47 


5 . 


.3 


29322 


8 


ADB72830 


Adb72 83 0 Human LFN 


c 


34 


47 


5 . 


.3 


29322 


9 


ADA663 76 


Ada6 63 76 Human LFN 


c 


35 


47 


5 . 


,3 


29322 


9 


ADA03092 


Ada03092 Human LFN 


c 


36 


47 


5 , 


,3 


30624 


16 


ADZ12747 


Adz 12 74 7 Human can 




37 


46 . 8 


5 . 


.3 


1074 


12 


ADT42247 


Adt42247 Bacterial 




38 


46 . 4 


5 , 


.3 


427 


2 


AAV62163 


Aav62163 HSV-2 str 




39 


46 . 4 


5 , 


,3 


16812 


2 


AAV62175 


Aav62175 HSV-2 str 


c 


40 


46 . 2 


5 . 


. 2 


726 


16 


AEH47721 


Aeh47721 Environme 


c 


41 


45 . 6 


5 . 


. 2 


1541 


19 


AEI61030 


Aei61030 Zea mays 




42 


45 


5 . 


, 1 


768 


14 


AFR16 600 


Afrl6600 Recombina 




43 


44 . 8 


5 , 


. 1 


673 


14 


AFR2 0 0 03 


Afr20003 Recombina 




44 


44.4 


5 , 


. 0 


1093 


13 


ADT18801 


Adtl8801 Plant cDN 


c 


45 


44 . 4 


5. 


.0 


2449 


20 


AFK76669 


Afk76669 Polymorph 



ALIGNMENTS 



RESULT 1 
ADM96967 

ID ADM96967 standard; cDNA; 881 BP. 

XX 

AC ADM96 96 7; 
XX 

DT 01-JUL-2004 (first entry) 
XX 

DE Human pancreatic cancer upregulated gene C1958V1. 
XX 

KW ds; gene; cytostatic; gene therapy; pancreatic cancer; diagnosis; 

KW anti-tumor immunity. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 163. .393 

FT /*tag= a 

FT /product= "C1958V1 protein" 

XX 

PN WO2004031411-A2 . 
XX 

PD 15-APR-2004. 

XX 
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PF 12-SEP-2003; 2003WO-JP011713 . 
XX 

PR 30-SEP-2002; 2002US-0414872P . 

PR 28-FEB-2003; 2003US-0450889P . 
XX 

PA (0NC0-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakamura Y, Katagiri T; 
XX 

DR WPI; 2004-330204/30. 

DR P-PSDB; ADM96968. 

XX 

PT New C1958V1 or C1958V2 polypeptides, useful in useful in diagnosing and 

PT treating pancreatic cancer and in inducing anti tumor immunity. 

XX 

PS Claim 2; SEQ ID NO 1; 71pp; English. 

XX 

CC The invention relates to the isolation of novel genes upregulated in 

CC pancreatic cancer designated C1958V1 and C1958V2, their encoded 

CC polypeptides (I), a seguence in which one or more amino acids are 

CC substituted, deleted, inserted, and/or added and that has a biological 

CC activity eguivalent to the C1958V1 or C1958V2 proteins; or a sequence 

CC encoded by a polynucleotide that hybridizes under stringent conditions to 

CC the C1958V1 or C1958V2 polynucleotides. The polypeptides and 

CC polynucleotides, compounds and compositions are useful in diagnosing and 

CC treating pancreatic cancer and in inducing anti tumor immunity. This 

CC sequence represents the C1958V1 cDNA sequence. 

XX 

SQ Sequence 881 BP; 178 A; 276 C; 264 G; 163 T; 0 U; 0 Other; 

Query Match 10 0.0%; Score 8 81; DB 12; Length 881; 

Best Local Similarity 100.0%; Pred. No. 1.6e-206; 

Matches 881; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 

Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

Qy 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 



Qy 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

Qy 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

Qy 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

Qy 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

Qy 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

Qy 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 

Qy 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

RESULT 2 
AES72471 
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ID AES72471 standard; cDNA; 881 BP. 

XX 

AC AES72471; 
XX 

DT 03-MAY-2007 (first entry) 
XX 

DE Human C1958 splice variant 1, cDNA. 
XX 

KW Pancreatic ductal adenocarcinoma; cancer; cytostatic; tumor marker; 

KW protein therapy; screening; splice variant; ss; gene; C1958V1; apoptosis; 

KW gene therapy; pancreas tumor; lung tumor; renal tumor; testicle tumor. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 163. .393 

FT /*tag= a 

FT /product= "C1958V1 protein" 
XX 

PN WO2007013358-A2 . 
XX 

PD 01-FEB-2007. 

XX 

PF 14-JUL-2006; 2006WO-JP314442 . 
XX 

PR 28-JUL-2005; 2005US-0703791P . 
XX 

PA (0NC0-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakamura Y, Katagiri T, Inaki K; 
XX 

DR WPI; 2007-283242/27. 

DR P-PSDB; AES72472. 

XX 

PT New VIVIT polypeptide useful for treating or preventing cancer, such as 

PT pancreatic cancer, lung cancer, kidney cancer and testicular tumor. 
XX 

PS Disclosure; SEQ ID NO 1; 78pp; English. 

XX 

CC The invention relates to a VIVIT polypeptide (AES72497) and at least a 

CC fragment of the human C1958 sequence appearing as AES72472, in which 

CC residues at positions 37-41 is replaced with AES72497, or an amino acid 

CC sequence of a polypeptide functionally equivalent to the polypeptide 

CC comprising the fragment sequence. Also included are an agent for 

CC treating/preventing cancer (comprising as an active ingredient the VIVIT 

CC polypeptide, or a polynucleotide encoding the polypeptide) , a 

CC pharmaceutical composition (comprising the VIVIT polypeptide, and a 

CC carrier), screening (Ml) for a compound useful in treating/preventing 
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CC cancers (involving (a) contacting a polypeptide comprising a PPP3CA- 

CC binding domain of a C1958 polypeptide with a polypeptide comprising a 

CC C1958-binding domain of a PPP3CA polypeptide in the presence of a test 

CC compound, (b) detecting binding between the polypeptides, and (c) 

CC selecting a test compound that inhibits binding between the 

CC polypeptides), a kit for screening for a compound for treating or 

CC preventing cancers, treating/preventing cancers in a subject (involving 

CC administering the compound selected above) , and inducing apoptosis in a 

CC cell (involving introducing a polypeptide having a dominant-negative 

CC effect against C1958 or a polynucleotide encoding the polypeptide into 

CC the cell, where the polypeptide comprises a fragment sequence having 

CC AES72497, or the mutated C1958 above) . The polypeptide is modified with a 

CC cell-membrane permeable substance, which has the general formula [R]-[D], 

CC where [R] represents the cell-membrane permeable substance, and [D] 

CC represents the amino acid sequence of a fragment sequence of AES72497, or 

CC the mutated C1958 peptide. The VIVIT polypeptide is useful for treating 

CC and/or preventing cancer, preferably pancreatic cancer (especially 

CC Pancreatic ductal adenocarcinoma) , lung cancer, kidney cancer and 

CC testicular tumor. (Ml) is useful for screening a compound for treating or 

CC preventing cancers. The present sequence encodes a splice variant of 

CC human C1958. 

XX 

SQ Sequence 881 BP; 178 A; 276 C; 264 G; 163 T; 0 U; 0 Other; 

Query Match 10 0.0%; Score 881; DB 22; Length 881; 

Best Local Similarity 100.0%; Pred. No. 1.6e-206; 

Matches 881; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 

Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

Qy 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

Qy 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

I I I I I I I I I I I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

Qy 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

Qy 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

Qy 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

Qy 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

Qy 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 

Qy 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 



RESULT 3 
ADM96 96 9 

ID ADM96969 standard; cDNA; 893 BP. 

XX 

AC ADM96 96 9; 
XX 

DT 01-JUL-2004 (first entry) 
XX 

DE Human pancreatic cancer upregulated gene C1958V2. 
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XX 

KW ds; gene; cytostatic; gene therapy; pancreatic cancer; diagnosis; 

KW anti-tumor immunity. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 197. .259 

FT /*tag= a 

FT /product= "C1958V2 protein" 

XX 

PN WO2004031411-A2 . 
XX 

PD 15-APR-2004. 

XX 

PF 12-SEP-2003; 2003WO-JP011713 . 
XX 

PR 30-SEP-2002; 2002US-0414872P . 

PR 28-FEB-2003; 2003US-0450889P . 
XX 

PA (ONCO-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakaraura Y, Katagiri T; 
XX 

DR WPI; 2004-330204/30. 

DR P-PSDB; ADM96970. 
XX 

PT New C1958V1 or C1958V2 polypeptides, useful in useful in diagnosing and 

PT treating pancreatic cancer and in inducing anti tumor immunity. 

XX 

PS Claim 2; SEQ ID NO 3; 71pp; English. 

XX 

CC The invention relates to the isolation of novel genes upregulated in 

CC pancreatic cancer designated C1958V1 and C1958V2, their encoded 

CC polypeptides (I), a seguence in which one or more amino acids are 

CC substituted, deleted, inserted, and/or added and that has a biological 

CC activity eguivalent to the C1958V1 or C1958V2 proteins; or a seguence 

CC encoded by a polynucleotide that hybridizes under stringent conditions to 

CC the C1958V1 or C1958V2 polynucleotides. The polypeptides and 

CC polynucleotides, compounds and compositions are useful in diagnosing and 

CC treating pancreatic cancer and in inducing anti tumor immunity. This 

CC seguence represents the C1958V2 cDNA sequence. 

XX 

SQ Sequence 893 BP; 175 A; 287 C; 274 G; 157 T; 0 U; 0 Other; 

Query Match 93.9%; Score 827; DB 12; Length 893; 
Best Local Similarity 97.5%; Pred. No. 3.2e-193; 

Matches 859; Conservative 0; Mismatches 0; Indels 22; Gaps 1; 
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Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 9 4 

Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 15 4 

Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 15 5 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 

Qy 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 

Qy 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 253 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 373 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 432 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 492 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 493 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

Qy 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

Qy 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

Qy 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 732 
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Qy 721 CCCCCAGGGCTGTGC7AAACACATGCCCCTGCCAT7AAGCACC7AAC7AAG7AACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 733 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 792 

Qy 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 793 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 

Qy 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 



RESULT 4 
AES72473 

ID AES72473 standard; cDNA; 893 BP. 

XX 

AC AES72473; 
XX 

DT 03-MAY-2007 (first entry) 
XX 

DE Human C1958 splice variant 3, cDNA. 
XX 

KW Pancreatic ductal adenocarcinoma; cancer; cytostatic; tumor marker; 

KW protein therapy; screening; splice variant; ss; gene; C1958V3; apoptosis; 

KW gene therapy; pancreas tumor; lung tumor; renal tumor; testicle tumor. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 197. .259 

FT /*tag= a 

FT /product= "C1958V3" 

XX 

PN WO2007013358-A2 . 
XX 

PD 01-FEB-2007. 

XX 

PF 14-JUL-2006; 2006WO-JP314442 . 
XX 

PR 28-JUL-2005; 2 0 05US-0 703 79 IP . 
XX 

PA (ONCO-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakamura Y, Katagiri T, Inaki K; 
XX 

DR WPI; 2007-283242/27. 

DR P-PSDB; AES72474. 
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XX 

PT New VIVIT polypeptide useful for treating or preventing cancer, such as 

PT pancreatic cancer, lung cancer, kidney cancer and testicular tumor. 
XX 

PS Disclosure; SEQ ID NO 3; 78pp; English. 

XX 

CC The invention relates to a VIVIT polypeptide (AES72497) and at least a 

CC fragment of the human C1958 sequence appearing as AES72472, in which 

CC residues at positions 37-41 is replaced with AES72497, or an amino acid 

CC sequence of a polypeptide functionally equivalent to the polypeptide 

CC comprising the fragment sequence. Also included are an agent for 

CC treating/preventing cancer (comprising as an active ingredient the VIVIT 

CC polypeptide, or a polynucleotide encoding the polypeptide) , a 

CC pharmaceutical composition (comprising the VIVIT polypeptide, and a 

CC carrier), screening (Ml) for a compound useful in treating/preventing 

CC cancers (involving (a) contacting a polypeptide comprising a PPP3CA- 

CC binding domain of a C1958 polypeptide with a polypeptide comprising a 

CC C1958-binding domain of a PPP3CA polypeptide in the presence of a test 

CC compound, (b) detecting binding between the polypeptides, and (c) 

CC selecting a test compound that inhibits binding between the 

CC polypeptides), a kit for screening for a compound for treating or 

CC preventing cancers, treating/preventing cancers in a subject (involving 

CC administering the compound selected above) , and inducing apoptosis in a 

CC cell (involving introducing a polypeptide having a dominant-negative 

CC effect against C1958 or a polynucleotide encoding the polypeptide into 

CC the cell, where the polypeptide comprises a fragment sequence having 

CC AES72497, or the mutated C1958 above) . The polypeptide is modified with a 

CC cell-membrane permeable substance, which has the general formula [R]-[D], 

CC where [R] represents the cell-membrane permeable substance, and [D] 

CC represents the amino acid sequence of a fragment sequence of AES72497, or 

CC the mutated C1958 peptide. The VIVIT polypeptide is useful for treating 

CC and/or preventing cancer, preferably pancreatic cancer (especially 

CC Pancreatic ductal adenocarcinoma) , lung cancer, kidney cancer and 

CC testicular tumor. (Ml) is useful for screening a compound for treating or 

CC preventing cancers. The present sequence encodes a splice variant of 

CC human C1958. 

XX 

SQ Sequence 893 BP; 175 A; 287 C; 274 G; 157 T; 0 U; 0 Other; 

Query Match 93.9%; Score 827; DB 22; Length 893; 

Best Local Similarity 97.5%; Pred. No. 3.2e-193; 

Matches 859; Conservative 0; Mismatches 0; Indels 22; Gaps 1; 

Qy 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 94 

Qy 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 95 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 15 4 



Qy 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 15 5 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 

Qy 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 

Qy 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 253 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

Qy 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

Qy 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 73 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 432 

Qy 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 492 

Qy 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 493 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

Qy 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

Qy 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

Qy 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 732 

Qy 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 733 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 792 

Qy 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 793 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 
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Qy 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 



RESULT 5 
AAK88446 



AAK8 8 446 standard; cDNA; 761 BP. 



AAK8 8 446; 



DT 05-NOV-2001 (first entry) 



DE Human digestive system antigen coding sequence SEQ ID NO: 762. 



KW 
KW 



XX 
OS 



Human; digestive system antigen; gene therapy; cancer; appendicitis; 
ulcerative colitis; infection; Hirschsprung's disease; chronic colit 
digestive system disorder; Meckel's diverticulum; ss . 

Homo sapiens . 



PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



PR 
PR 



WO200155314-A2 . 



02-AUG-2001 . 



17-JAN-2001; 2001WO-US001324 . 



31-JAN-2000; 
04-FEB-2000; 
24-FEB-2000; 
02-MAR-2000; 

16- MAR-2 0 00; 

17- MAR-2 0 00; 

18- APR-2000; 

19- MAY-2000; 
0 7-JUN-2 0 00; 
28-JUN-2000; 
30-JUN-2000; 
07-JUL-2000; 
07-JUL-2000; 
ll-JUL-2000; 
ll-JUL-2000; 
14-JUL-2000; 
26-JUL-2 0 00; 
26-JUL-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 



2000US-0179065P. 
2000US-0180628P. 
2000US-0184664P. 
2000US-0186350P. 
2000US-0189874P. 
2000US-0190076P. 
2000US-0198123P. 
2000US-0205515P. 
2000US-0209467P. 
2000US-0214886P. 
2000US-0215135P. 
2000US-0216647P. 
2000US-0216880P. 
2000US-0217487P . 
2000US-0217496P. 
2000US-0218290P. 
2000US-0220963P. 
2000US-0220964P. 
2000US-0224518P. 
2000US-0224519P. 
2000US-0225213P. 



http://es/ScoreAccessWeb/GetItem.action?AppId=1052959...10_005435_ 



-10-529-592a-l.rng&ItemType=4&startByte=0 (14 of 44)6/7/2008 9:10:21 PM 



SCORE Search Results Details for Application 10529592 and Search Result 20080410_005435_us-10-529-592a-l.rng. 



PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


14 


-AUG- 


2000 


PR 


18 


-AUG- 


2000 


PR 


22 


-AUG- 


2000 


PR 


22 


-AUG- 


2000 


PR 


22 


-AUG- 


2000 


PR 


23 


-AUG- 


2000 


PR 


30 


-AUG- 


2000 


PR 


01 


-SEP- 


2000 


PR 


01 


-SEP- 


2000 


PR 


01 


-SEP- 


2000 


PR 


01 


-SEP- 


2000 


PR 


05 


-SEP- 


2000 


PR 


05 


-SEP- 


2000 


PR 


06 


-SEP- 


2000 


PR 


06 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


08 


-SEP- 


2000 


PR 


12 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


14 


-SEP- 


2000 


PR 


21 


-SEP- 


2000 


PR 


21 


-SEP- 


2000 


PR 


25 


-SEP- 


2000 


PR 


25 


-SEP- 


2000 


PR 


26 


-SEP- 


2000 


PR 


27 


-SEP- 


2000 
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2000US-0225214P. 
2000US-0225266P. 
2000US-0225267P. 
2000US-0225268P. 
2000US-0225270P. 
2000US-0225447P. 
2000US-0225757P. 
2000US-0225758P. 
2000US-0225759P. 
2000US-0226279P. 
2000US-0226681P. 
2000US-0226868P. 
2000US-0227182P. 
2000US-0227009P. 
2000US-0228924P. 
2000US-0229287P. 
2000US-0229343P. 
2000US-0229344P. 
2000US-0229345P. 
2000US-0229509P. 
2000US-0229513P . 
2000US-0230437P. 
2000US-0230438P. 
2000US-0231242P. 
2000US-0231243P. 
2000US-0231244P. 
2000US-0231413P. 
2000US-0231414P. 
2000US-0232080P. 
2000US-0232081P. 
2000US-0231968P. 
2000US-0232397P. 
2000US-0232398P. 
2000US-0232399P. 
2000US-0232400P. 
2000US-0232401P. 
2000US-0233063P. 
2000US-0233064P. 
2000US-0233065P. 
2000US-0234223P. 
2000US-0234274P. 
2000US-0234997P. 
2000US-0234998P. 
2000US-0235484P. 
2000US-0235834P. 
2000US-0235836P. 
2000US-0236327P. 
2000US-0236367P. 
2000US-0236368P. 
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2000US-0236369P. 
2000US-0236370P. 
2000US-0236802P. 
2000US-0237037P. 
2000US-0237038P. 
2000US-0237039P. 
2000US-0237040P. 
2000US-0239935P. 
2000US-0239937P. 
2000US-0240960P. 
2000US-0241221P. 
2000US-0241785P. 
2000US-0241786P. 
2000US-0241787P. 
2000US-0241808P . 
2000US-0241809P. 
2000US-0241826P. 
2000US-0244617P. 
2000US-0246474P. 
2000US-0246475P. 
2000US-0246476P. 
2000US-0246477P. 
2000US-0246478P. 
2000US-0246523P. 
2000US-0246524P. 
2000US-0246525P. 
2000US-0246526P. 
2000US-0246527P. 
2000US-0246528P. 
2000US-0246532P. 
2000US-0246609P. 
2000US-0246610P . 
2000US-0246611P. 
2000US-0246613P. 
2000US-0249207P. 
2000US-0249208P. 
2000US-0249209P. 
2000US-0249210P. 
2000US-0249211P. 
2000US-0249212P. 
2000US-0249213P . 
2000US-0249214P . 
2000US-0249215P. 
2000US-0249216P. 
2000US-0249217P . 
2000US-0249218P. 
2000US-0249244P. 
2000US-0249245P. 
2000US-0249264P. 
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PR 
PR 



PR 

XX 



XX 
PI 



17-N0V- 
17-N0V- 
17-N0V- 
17-N0V- 
01-DEC- 
01-DEC- 
05-DEC- 
05-DEC- 

05- DEC- 

06- DEC- 
08-DEC- 
08-DEC- 
08-DEC- 
08-DEC- 
08-DEC- 
11-DEC- 
05-JAN- 



2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2000; 
2001; 



2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2000US- 
2001US- 



0249265P. 
0249297P. 
0249299P. 
0249300P. 
0250160P. 
0250391P. 
0251030P. 
0251988P. 
0256719P. 
0251479P. 
0251856P. 
0251868P. 
0251869P. 
0251989P. 
0251990P. 
0254097P. 
0259678P. 



(HUMA-) HUMAN GENOME SCI INC. 
Rosen CA, Barash SC, Ruben SM; 



DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 



WPI; 2001-502630/55. 
P-PSDB; AAM926 73. 

Polynucleotides encoding digestive system antigens, useful for 
diagnosing, treating, preventing and/or prognozing disorders of the 
digestive system, particularly cancer and cancer metastases . 

Claim 1; SEQ ID NO 762; 986pp; English. 

The present invention provides the protein and coding sequences of a 
number of human digestive system antigens. These can be used in the 
diagnosis, treatment and prevention of digestive system disorders, 
including cancer, Meckel's diverticulum, bacterial or parasitic 
infections, appendicitis, Hirschsprung's disease, chronic colitis or 
ulcerative colitis. The present sequence is a cDNA encoding a digestive 
system antigen of the invention 

Sequence 761 BP; 183 A; 208 C; 218 G; 150 T; 0 U; 2 Other; 



Query Match 79 . 8%; 

Best Local Similarity 99.3%; 
Matches 735; Conservative 



Score 703.2; DB 4 
Pred. No. 9.4e-163 
2; Mismatches 0 



Length 761; 



Indels 



Gaps 



Qy 



142 GCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAAGGCTTTCAAATG 201 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 GCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAA-GCTTTCAAATG 5 9 
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Qy 202 TGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTG 261 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I I I I I 
Db 6 0 TGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGG-CCCCGTCCTGAAMGWCAAGCACCTG 118 

Qy 26 2 GACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGAC 321 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 119 GACGTGCCCGACATCATCATCACGCCCCCCA-CCCCACGGGCATGATGCTGCCGAGGGAC 177 

Qy 32 2 TTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGAC 381 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 178 TTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGAC 23 7 

Qy 382 CCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGC 441 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 23 8 CCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGC 29 7 

Qy 442 TTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAAC 501 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 29 8 TTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAAC 35 7 

Qy 502 TGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTG 561 

I I I I I I I I I I I I I I I I I I I I I I II I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 8 TGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTG 417 

Qy 56 2 TCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCC 621 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 418 TCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCC 477 

Qy 622 AGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCC 6 81 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 478 AGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCC 53 7 

Qy 6 82 CAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACAC 741 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 53 8 CAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACAC 59 7 

Qy 742 ATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAA 801 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 59 8 ATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAA 65 7 

Qy 802 GTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTAT 861 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 65 8 GTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTAT 717 

Qy 86 2 AAATAAAGATCCTCTGTAAC 8 81 

I I I I I I I I I I I I I I I I I I I I 
Db 718 AAATAAAGATCCTCTGTAAC 737 
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RESULT 6 
AAI90742/C 

ID AAI90742 standard; cDNA; 963 BP. 

XX 

AC AAI90742; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 10802. 
XX 

KW Human; cytokine; cell proliferation; cell differentiation; gene therapy; 

KW vaccine; peptide therapy; stem cell growth factor; haematopoiesis; 

KW tissue growth factor; immunomodulatory; cancer; leukaemia; 

KW nervous system disorders; arthritis; inflammation; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200164835-A2 . 
XX 

PD 07-SEP-2001. 

XX 

PF 26-FEB-2001; 2 0 0 1WO-US0 0 49 2 7 . 
XX 

PR 28-FEB-2000; 2000US-00515126 . 

PR 18-MAY-2000; 2000US-00577409 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-514838/56. 

DR P-PSDB; AAO10811. 

XX 

PT Isolated nucleic acids and polypeptides, useful for preventing diagnosing 

PT and treating e.g. leukemia, inflammation and immune disorders. 

XX 

PS Claim 1; SEQ ID NO 10802; 1399pp + Sequence Listing; English. 

XX 

CC The invention relates to human polynucleotides (AAI79941-AAI93841 ) and 

CC the encoded proteins (AAO00010-AAO13910 ) that exhibit activity elating to 

CC cytokine, cell proliferation or cell differentiation or which may induce 

CC production of other cytokines in other cell populations . The 

CC polynucleotides and polypeptides are useful in gene therapy, vaccines or 

CC peptide therapy. The polypeptides have various cytokine-like activities, 

CC e.g. stem cell growth factor activity, haematopoiesis regulating 

CC activity, tissue growth factor activity, immunomodulatory activity and 

CC activin/inhibin activity and may be useful in the diagnosis and/or 

CC treatment of cancer, leukaemia, nervous system disorders, arthritis and 

CC inflammation. Note: The sequence data for this patent did not form part 
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CC of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 963 BP; 202 A; 296 C; 269 G; 194 T; 0 U; 2 Other; 

Query Match 78.8%; Score 694.2; DB 4; Length 963; 

Best Local Similarity 98.9%; Pred. No. 1.7e-160; 

Matches 699; Conservative 0; Mismatches 8; Indels 0; Gaps 0; 

Qy 175 ATGTCCTGCCTGAAAGGCTTTCA7AATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAG 23 4 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 715 ACGCCTGGCTTCTCAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAG 656 



Qy 235 GCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACC 294 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 655 GCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACC 5 96 



Qy 295 CCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGG 35 4 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 595 CCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGG 536 



Qy 35 5 TCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCT 414 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 535 TCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCT 476 



Qy 415 GGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCA 474 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 475 GGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCA 416 



Qy 4 75 GGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGT 53 4 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 415 GGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGT 356 



Qy 535 GGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGC 594 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 5 GGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGC 2 96 



Qy 595 TGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGA 654 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 295 TGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGA 236 



Qy 655 GCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCC 714 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 235 GCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCC 176 



Qy 715 TCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCT 7 74 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 175 TCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCT 116 
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Qy 



7 75 TGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATG 83 4 




Db 



115 TGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATG 5 6 



Qy 



835 GGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 




Db 



5 5 GGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 9 



RESULT 7 
ADE09694/c 

ID ADE09694 standard; DNA; 963 BP. 
XX 

AC ADE09694; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Novel DNA-related contig nucleotide sequence #416. 
XX 

KW novel gene; novel protein; tissue marker; molecular weight marker; 

KW chromosome marker; genetic disorder; contig; ds . 

XX 

OS Unidentified. 
XX 

PN WO2003054152-A2 . 
XX 

PD 03-JUL-2003. 

XX 

PF 10-DEC-2002; 2002WO-US039555 . 
XX 

PR 10-DEC-2001; 2 0 0 1US-0 33 9 73 9P . 

PR ll-DEC-2001; 2001US-0339453P . 

PR 14-MAR-2002; 2002US-0365091P . 

PR 14-MAR-2002; 2002US-0365384P . 

PR 12-APR-2002; 2002US-0372381P . 

PR 12-APR-2002; 2002US-0372615P . 

PR 22-APR-2002; 2002US-00128558 . 

PR 24-APR-2002; 2 0 02US-0 3 76 0 45P . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Asundi V, Goodrich RW, Ren F, Zhang J, Zhao QA, Wang J; 

PI Ghosh M, Xue AJ, Wehrman T, Weng G, Zhou P, Drmanac RT, Wang Z; 

PI Ma Y, Wang D, Chen R, Xu C, Boyle BJ; 
XX 

DR WPI; 2003-569235/53. 
XX 

PT New polynucleotides, useful for expressing recombinant proteins for 

PT analysis, characterization or therapeutic use, or as markers for tissues 
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PT in which the corresponding protein is preferentially expressed. 
XX 

PS Disclosure; SEQ ID NO 2238; 1177pp; English. 

XX 

CC The invention comprises the amino acid and coding sequences of novel 

CC proteins. The DNA and protein sequences of the invention are useful as: 

CC markers for tissues in which the corresponding protein is preferentially 

CC expressed; as molecular weight markers on gels; as chromosome markers or 

CC tags; to identify chromosomes or to map related gene positions; and to 

CC compare with endogenous DNA sequences in patients to identify potential 

CC genetic disorders. The present DNA sequence was used in the 

CC exemplification of the invention. 
XX 

SQ Sequence 963 BP; 202 A; 296 C; 269 G; 194 T; 0 U; 2 Other; 

Query Match 78.8%; Score 694.2; DB 9; Length 963; 

Best Local Similarity 98.9%; Pred. No. 1.7e-160; 

Matches 6 99; Conservative 0; Mismatches 8; Indels 0; Gaps 0; 

Qy 175 ATGTCCTGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAG 23 4 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 715 ACGCCTGGCTTCTCAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAG 6 56 

Qy 235 GCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACC 294 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 655 GCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACC 5 96 

Qy 295 CCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGG 35 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 595 CCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGG 536 

Qy 35 5 TCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCT 414 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 535 TCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCT 476 

Qy 415 GGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCA 474 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 475 GGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCA 416 

Qy 4 75 GGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGT 53 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 415 GGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGT 356 

Qy 535 GGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGC 594 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 5 GGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGC 2 96 

Qy 595 TGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGA 654 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 295 TGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTG7AAAGGGA 236 



QY 



65 5 GCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCC 714 




Db 



235 GCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCC 176 



Qy 



715 TCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCT 7 74 




Db 



175 TCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCT 116 



Qy 



7 75 TGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATG 83 4 




Db 



115 TGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATG 5 6 



Qy 



835 GGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 




Db 



5 5 GGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 9 



RESULT 8 
ADR26670/C 

ID ADR26670 standard; DNA; 614 BP. 
XX 

AC ADR26670; 
XX 

DT 21-OCT-2004 (first entry) 
XX 

DE Breast cancer prognosis marker #2531. 
XX 

KW ds; breast cancer; prognosis; gene expression; diagnosis. 
XX 

OS Homo sapiens . 
XX 

PN WO2004065545-A2 . 
XX 

PD 05-AUG-2004. 

XX 

PF 15-JAN-2004; 2004WO-US001100 . 
XX 

PR 15-JAN-2003; 2003US-00342887 . 
XX 

PA (ROSE-) ROSETTA INPHARMATICS LLC. 

PA (NECA-) NETHERLANDS CANCER INST. 
XX 

PI Van't Veer LJ, He Y; 
XX 

DR WPI; 2004-593473/57. 
XX 

PT Classifying a breast cancer patient according to prognosis comprises 
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PT determining the similarity between the level of expression of each of 

PT five genes in a cell sample taken from patient, to control levels. 
XX 

PS Disclosure; SEQ ID NO 2531; 226pp; English. 

XX 

CC The invention relates to a method of classifying a breast cancer patient 

CC according to prognosis by determining the similarity between the level of 

CC expression of each of five genes for which markers are listed in the 

CC specification, in a cell sample taken from the breast cancer patient, to 

CC control levels of expression for each respective five genes to obtain a 

CC patient similarity value. The methods are useful for classifying a breast 

CC cancer patient according to prognosis. Kits and computer program products 

CC are useful for data analysis using the diagnostic, prognostic and 

CC statistical methods of the invention. This sequence corresponds to a 

CC marker used in the method of the invention. 

XX 

SQ Sequence 614 BP; 130 A; 179 C; 171 G; 134 T; 0 U; 0 Other; 



Query Match 6 9.4%; Score 611; DB 13; Length 614; 

Best Local Similarity 100.0%; Pred. No. 4.4e-140; 

Matches 611; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 71 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 33 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 614 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 555 



Qy 331 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 55 4 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 495 



Qy 391 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 450 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 494 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 435 



Qy 451 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43 4 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 3 75 



Qy 511 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 5 70 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 74 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 315 



Qy 5 71 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 63 0 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 314 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 255 



Qy 631 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 6 90 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 254 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 195 
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Qy 691 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 75 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 194 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 135 

Qy 751 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 13 4 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 75 

Qy 811 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 70 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 4 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 15 

Qy 8 71 TCCTCTGTAAC 881 

I I I I I I I I I I I 
Db 14 TCCTCTGTAAC 4 



RESULT 9 
AAK90422 

ID AAK90422 standard; DNA; 2798 BP. 
XX 

AC AAK9 0 42 2; 
XX 

DT 05-NOV-2001 (first entry) 
XX 

DE Human digestive system antigen genomic sequence SEQ ID NO: 3998. 
XX 

KW Human; digestive system antigen; gene therapy; cancer; appendicitis; 

KW ulcerative colitis; infection; Hirschsprung's disease; chronic colitis; 

KW digestive system disorder; Meckel's diverticulum; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200155314-A2 . 
XX 

PD 02-AUG-2001. 

XX 

PF 17-JAN-2001; 2001WO-US001324 . 
XX 

PR 31-JAN-2000; 2000US-0179065P . 

PR 04-FEB-2000; 2000US-0180628P . 

PR 24-FEB-2000; 2 0 0 OUS-0 1 8 46 6 4P . 

PR 02-MAR-2000; 2000US-0186350P . 

PR 16-MAR-2000; 2000US-0189874P . 

PR 17-MAR-2000; 2000US-0190076P . 

PR 18-APR-2000; 2000US-0198123P . 

PR 19-MAY-2000; 2000US-0205515P . 

PR 07-JUN-2000; 2000US-020946 7P . 
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2000US-0214886P. 
2000US-0215135P. 
2000US-0216647P. 
2000US-0216880P. 
2000US-0217487P . 
2000US-0217496P. 
2000US-0218290P. 
2000US-0220963P. 
2000US-0220964P. 
2000US-0224518P. 
2000US-0224519P. 
2000US-0225213P. 
2000US-0225214P. 
2000US-0225266P. 
2000US-0225267P. 
2000US-0225268P. 
2000US-0225270P. 
2000US-0225447P. 
2000US-0225757P. 
2000US-0225758P. 
2000US-0225759P. 
2000US-0226279P. 
2000US-0226681P. 
2000US-0226868P. 
2000US-0227182P. 
2000US-0227009P. 
2000US-0228924P. 
2000US-0229287P. 
2000US-0229343P. 
2000US-0229344P. 
2000US-0229345P. 
2000US-0229509P. 
2000US-0229513P. 
2000US-0230437P. 
2000US-0230438P. 
2000US-0231242P. 
2000US-0231243P. 
2000US-0231244P. 
2000US-0231413P. 
2000US-0231414P. 
2000US-0232080P. 
2000US-0232081P. 
2000US-0231968P. 
2000US-0232397P. 
2000US-0232398P. 
2000US-0232399P. 
2000US-0232400P. 
2000US-0232401P. 
2000US-0233063P. 
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2000US-0233064P. 
2000US-0233065P. 
2000US-0234223P. 
2000US-0234274P. 
2000US-0234997P. 
2000US-0234998P. 
2000US-0235484P. 
2000US-0235834P. 
2000US-0235836P. 
2000US-0236327P. 
2000US-0236367P. 
2000US-0236368P. 
2000US-0236369P. 
2000US-0236370P. 
2000US-0236802P. 
2000US-0237037P. 
2000US-0237038P. 
2000US-0237039P. 
2000US-0237040P. 
2000US-0239935P. 
2000US-0239937P. 
2000US-0240960P. 
2000US-0241221P. 
2000US-0241785P. 
2000US-0241786P. 
2000US-0241787P. 
2000US-0241808P. 
2000US-0241809P. 
2000US-0241826P. 
2000US-0244617P. 
2000US-0246474P. 
2000US-0246475P. 
2000US-0246476P. 
2000US-0246477P. 
2000US-0246478P. 
2000US-0246523P. 
2000US-0246524P. 
2000US-0246525P. 
2000US-0246526P. 
2000US-0246527P. 
2000US-0246528P. 
2000US-0246532P. 
2000US-0246609P. 
2000US-0246610P. 
2000US-0246611P. 
2000US-0246613P. 
2000US-0249207P. 
2000US-0249208P. 
2000US-0249209P. 
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PR 17-NOV-2000; 2000US-0249210P . 

PR 17-NOV-2000; 2000US-0249211P . 

PR 17-NOV-2000; 2000US-0249212P . 

PR 17-NOV-2000; 2000US-0249213P . 

PR 17-NOV-2000; 2000US-0249214P . 

PR 17-NOV-2000; 2000US-0249215P . 

PR 17-NOV-2000; 2000US-0249216P . 

PR 17-NOV-2000; 2 0 0 OUS-0 2 4 9 2 1 7P . 

PR 17-NOV-2000; 2000US-0249218P . 

PR 17-NOV-2000; 2000US-0249244P . 

PR 17-NOV-2000; 2000US-0249245P . 

PR 17-NOV-2000; 2 0 0 OUS-0 2 4 9 26 4P . 

PR 17-NOV-2000; 2000US-0249265P . 

PR 17-NOV-2000; 2 0 0 OUS-0 2 4 9 2 9 7P . 

PR 17-NOV-2000; 2000US-0249299P . 

PR 17-NOV-2000; 2000US-0249300P . 

PR 01-DEC-2000; 2000US-0250160P . 

PR 01-DEC-2000; 2000US-0250391P . 

PR 05-DEC-2000; 2000US-0251030P . 

PR 05-DEC-2000; 2000US-0251988P . 

PR 05-DEC-2000; 2000US-0256 719P . 

PR 06-DEC-2000; 2000US-0251479P . 

PR 08-DEC-2000; 2000US-0251856P . 

PR 08-DEC-2000; 2000US-0251868P . 

PR 08-DEC-2000; 2000US-0251869P . 

PR 08-DEC-2000; 2000US-0251989P . 

PR 08-DEC-2000; 2000US-0251990P . 

PR ll-DEC-2000; 2000US-0254097P . 

PR 05-JAN-2001; 2001US-0259678P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 
XX 

DR WPI; 2001-502630/55. 

XX 

PT Polynucleotides encoding digestive system antigens, useful for 

PT diagnosing, treating, preventing and/or prognozing disorders of the 

PT digestive system, particularly cancer and cancer metastases . 

XX 

PS Disclosure; SEQ ID NO 3998; 986pp; English. 

XX 

CC The present invention provides the protein and coding sequences of a 

CC number of human digestive system antigens. These can be used in the 

CC diagnosis, treatment and prevention of digestive system disorders, 

CC including cancer, Meckel's diverticulum, bacterial or parasitic 

CC infections, appendicitis, Hirschsprung's disease, chronic colitis or 

CC ulcerative colitis. The present sequence is a genomic DNA fragment 

CC encoding a digestive system antigen of the invention 
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XX 

SQ Sequence 2798 BP; 594 A; 775 C; 809 G; 620 T; 0 U; 0 Other; 

Query Match 62.3%; Score 549; DB 4; Length 2798; 

Best Local Similarity 100.0%; Pred. No. 1.4e-124; 

Matches 549; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 392 

I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2240 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 22 99 



Qy 393 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2300 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 2359 



Qy 453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 236 0 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 2 419 



Qy 513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 42 0 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 2 4 79 



Qy 5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 632 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2480 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 2539 



Qy 633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 5 4 0 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 



Qy 693 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 0 0 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 26 59 



Qy 753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 0 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 2 719 



Qy 813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 72 0 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 2 7 79 



Qy 873 CTCTGTAAC 881 

I I I I I I I I I 
Db 2780 CTCTGTAAC 2788 



RESULT 10 
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AAK90423 

ID AAK90423 standard; 
XX 

AC AAK9 0 423; 



DNA; 2804 BP. 



DT 05-NOV-2001 (first entry) 
XX 

DE Human digestive system antigen genomic sequence SEQ ID NO: 3999. 
XX 

KW Human; digestive system antigen; gene therapy; cancer; appendicitis; 

KW ulcerative colitis; infection; Hirschsprung's disease; chronic colitis; 

KW digestive system disorder; Meckel's diverticulum; ds . 



OS 
XX 



XX 
PD 



Homo sapiens . 



WO200155314-A2 . 



02-AUG-2001 . 



PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



PR 
PR 



17-JAN-2001; 2001WO-US001324 . 



31-JAN-2000; 
04-FEB-2000; 
24-FEB-2000; 
02-MAR-2000; 

16- MAR-2 0 00; 

17- MAR-2 0 00; 



-APR-2 0 00; 
-MAY-2 0 00; 



0 7-JUN-2 0 00; 
28-JUN-2000; 
30-JUN-2000; 
07-JUL-2000; 
07-JUL-2000; 
ll-JUL-2000; 
ll-JUL-2000; 
14-JUL-2000; 
26-JUL-2 0 00; 
26-JUL-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2000; 
14-AUG-2000; 
14-AUG-2000; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 



2000US-0179065P. 
2000US-0180628P. 
2000US-0184664P. 
2000US-0186350P. 
2000US-0189874P. 
2000US-0190076P. 
2000US-0198123P. 
2000US-0205515P. 
2000US-0209467P. 
2000US-0214886P. 
2000US-0215135P. 
2000US-0216647P. 
2000US-0216880P. 
2000US-0217487P. 
2000US-0217496P. 
2000US-0218290P. 
2000US-0220963P. 
2000US-0220964P. 
2000US-0224518P. 
2000US-0224519P . 
2000US-0225213P . 
2000US-0225214P. 
2000US-0225266P. 
2000US-0225267P. 
2000US-0225268P. 
2000US-0225270P. 
2000US-0225447P. 
2000US-0225757P. 
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2000US-0239935P. 
2000US-0239937P. 
2000US-0240960P. 
2000US-0241221P. 
2000US-0241785P. 
2000US-0241786P. 
2000US-0241787P. 
2000US-0241808P. 
2000US-0241809P. 
2000US-0241826P. 
2000US-0244617P. 
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2000US-0246475P. 
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11 
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-2 0 00 ; 
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PR 


05 


-JAN- 


-2001; 


2001US- 


-0259678P 



XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 
XX 

DR WPI; 2001-502630/55. 

XX 

PT Polynucleotides encoding digestive system antigens, useful for 

PT diagnosing, treating, preventing and/or prognozing disorders of the 

PT digestive system, particularly cancer and cancer metastases . 

XX 

PS Disclosure; SEQ ID NO 3999; 986pp; English. 
XX 

CC The present invention provides the protein and coding sequences of a 

CC number of human digestive system antigens. These can be used in the 

CC diagnosis, treatment and prevention of digestive system disorders, 

CC including cancer, Meckel's diverticulum, bacterial or parasitic 

CC infections, appendicitis, Hirschsprung's disease, chronic colitis or 

CC ulcerative colitis. The present sequence is a genomic DNA fragment 

CC encoding a digestive system antigen of the invention 
XX 

SQ Sequence 2804 BP; 598 A; 777 C; 808 G; 621 T; 0 U; 0 Other; 



Query Match 62.1%; Score 547.4; DB 4; Length 2804; 

Best Local Similarity 99.8%; Pred. No. 3.5e-124; 

Matches 548; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 333 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 392 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 2 46 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 23 05 



Qy 393 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 452 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2306 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 2365 



Qy 453 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 236 6 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 2 425 
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Qy 513 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 5 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2426 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 2485 

Qy 5 73 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 632 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2486 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 2545 

Qy 633 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 6 92 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2546 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 26 05 

Qy 693 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 26 06 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTCCC 26 65 

Qy 753 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 812 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2666 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 2725 

Qy 813 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 8 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 726 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 2 785 

Qy 873 CTCTGTAAC 881 

I I I I I I I I I 
Db 2786 CTCTGTAAC 2794 



RESULT 11 
ABL65256 

ID ABL65256 standard; DNA; 574 BP. 
XX 

AC ABL65256; 
XX 

DT ll-JUN-2007 (revised) 

DT 15-MAY-2002 (first entry) 

XX 

DE Lung cancer related gene sequence SEQ ID NO: 3593. 
XX 

KW Human; cancer; colon; breast; ovary; oesophagus; kidney; thyroid; 

KW stomach; lung; prostate; pancreas; carcinoma; antitumour; cancerous; 

KW cytostatic; gene therapy; antineoplastic; Wilm's tumour; adenocarcinoma; 

KW gene; ds . 

XX 

OS Homo sapiens . 
XX 

PN WO200194629-A2 . 
XX 
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PA 


(AVAL-) 


AVA 


XX 









2001WO-US010838 . 

2000US-0209473P. 
2000US-0209531P. 
2000US-0233133P. 
2000US-0233617P. 
2000US-0234009P. 
2000US-0234034P. 
2000US-0234052P. 
2000US-0234509P. 
2000US-0234567P. 
2000US-0234923P. 
2000US-0234924P. 
2000US-0235077P. 
2000US-0235082P. 
2000US-0235134P. 
2000US-0235280P. 
2000US-0235637P. 
2000US-0235638P. 
2000US-0235711P. 
2000US-0235720P. 
2000US-0235840P. 
2000US-0235863P. 
2000US-0236028P. 
2000US-0236032P. 
2000US-0236033P. 
2000US-0236034P. 
2000US-0236109P. 
2000US-0236111P. 
2000US-0236842P. 
2000US-0236891P. 
2000US-0237172P. 
2000US-0237173P. 
2000US-0237278P. 
2000US-0237294P. 
2000US-0237295P. 
2000US-0237316P. 
2000US-0237425P. 
2000US-0237598P. 
2000US-0237604P. 
2000US-0237606P. 
2000US-0237608P. 
2000US-0244867P. 
2000US-0245084P. 

>N PHARM. 
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PI Young PE, Augustus M, Carter KC, Ebner R, Endress G, Horrigan S; 

PI Soppet DR, Weaver Z; 

XX 

DR WPI; 2002-188264/24. 

DR PC:NCBI; gil318992. 

XX 

PT Screening for ant i-neoplast ic agent involves exposing cells to a chemical 

PT agent to be tested for ant i-neoplastic activity, and determining a change 

PT in expression of a gene of a signature gene set. 
XX 

PS Claim 1; SEQ ID NO 3593; 44pp; English. 

XX 

CC The present invention describes a method (Ml) for screening for an anti- 

CC neoplastic agent. The method involves exposing cells to a chemical agent 

CC to be tested for ant i-neoplast ic activity, determining a change in 

CC expression of at least one gene (I) of a signature gene set, where (I) 

CC comprises a sequence (S) selected from 8447 sequences (given in ABL61664 

CC to ABL70110), or is at least 95% identical to (S), where a change in 

CC expression is indicative of ant i-neoplastic activity. (I) has cytostatic 

CC activity and can be used in gene therapy. Ml can be used for screening an 

CC anti-neoplastic agent, and can be used for producing a product which is 

CC the data collected with respect to the anti-neoplastic agent as a result 

CC of Ml, and the data is sufficient to convey the chemical structure and/or 

CC properties of the agent. Ml can be used in the treatment of cancer such 

CC as colon, breast, stomach, lung, thyroid, oesophageal, ovarian, kidney, 

CC prostate or pancreatic cancer, adenocarcinoma, carcinoma, clear cell 

CC cancer, infiltrating ductal cancer, infiltrating lobular cancer, squamous 

CC cell carcinoma, neuroendocrine carcinoma, papillary carcinoma and Wilm's 

CC tumour 

CC 

CC Revised record issued on ll-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 574 BP; 140 A; 144 C; 158 G; 128 T; 0 U; 4 Other; 



Query Match 41.7%; Score 36 7; DB 6; Length 5 74; 

Best Local Similarity 88.8%; Pred. No. 5.4e-80; 

Matches 491; Conservative 0; Mismatches 42; Indels 20; Gaps 8; 

Qy 348 GACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGG 40 7 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 GACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGG 6 6 

Qy 40 8 TTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCC 46 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 7 TTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCC 126 

Qy 46 8 TGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGA 52 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 12 7 TGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGA 186 



Qy 52 8 GAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAG 58 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 7 GAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAN 2 46 

Qy 588 GCCCCGCTGAGTGGACCGGACCTCTGACACCTCC-AGGTTCTTGCTGACTCCGGCCTGGT 646 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 24 7 GCCCCGCTNAGTGGACCGGACCTCTGACACCTCCAAGGTTCTTGCTGACTCCGGCCTGGT 3 06 

Qy 647 GAAAGGG-AGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGA— GAGGCTCTCTTCT-GG 702 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 30 7 GAAAGGGAAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAAGAAGGCTCTCTTCTNGG 36 6 

Qy 703 ACAAACACACCCTCCCAGCCCCCAGGGCTGT GCAAACACATGCCCCTGCCATAAGCA 75 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III III I I I I I I I I 

Db 36 7 ACAAACACACCCTCCCAGCCCCCAGGGCTGTTGCAAACACATTGCCCCTTGCCATAAAGC 426 

Qy 76 0 CCAACAAGAACTTCTTGCAGGTGGAGTG GCTGTTTTTTATAAGTTGTTTTA 810 

III II III I I I I I I I I I I III I I I I I I I I 

Db 42 7 ACCAAACAAAGAACTTCTTTGCAGGGTGGAGTGGGCTGTTTTTTAATAAAGTTTGTTTTA 486 

Qy 811 C AGA- T AC GGAAAC AGT C C AAAAT GGGAT T T AT AAT TTCTTTTTT — GCATTATAAATAA 86 7 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 48 7 CAGATTACGGAAACAGTTCAAAATGGGATTTATAATTTCTTTTTTTGCATTAATAAATAA 5 46 

Qy 868 AGATCCTCTGTAA 880 

I I I I I I I I I I I I 
Db 547 AGATCCTCTGTTA 559 



RESULT 12 
ADE07493 

ID ADE0 7493 standard; DNA; 1617 BP. 
XX 

AC ADE07493; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Novel coding sequence (useful for identifying genetic disorders) #559. 
XX 

KW novel gene; novel protein; tissue marker; molecular weight marker; 

KW chromosome marker; genetic disorder; gene; ds . 

XX 

OS Unidentified. 

XX 

PN WO2003054152-A2 . 
XX 

PD 03-JUL-2003. 
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XX 



PF 


10- 


-DEC- 


-2002; 


2002WO- 


■US039555 


XX 












PR 


10- 


-DEC- 


-2001; 


2001US- 


-0339739P 


PR 


11- 


-DEC- 


-2001; 


2001US- 


-0339453P 


PR 


14- 


-MAR- 


-2002; 


2002US- 


-0365091P 


PR 


14- 


-MAR- 


-2002; 


2002US- 


■0365384P 


PR 


12- 


-APR- 


-2002; 


2002US- 


-0372381P 


PR 


12- 


-APR- 


-2002; 


2002US- 


-0372615P 


PR 


22- 


-APR- 


-2002; 


2002US- 


-00128558 


PR 


24- 


-APR- 


-2002; 


2002US- 


■0376045P 



XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Asundi V, Goodrich RW, Ren F, Zhang J, Zhao QA, Wang J; 

PI Ghosh M, Xue AJ, Wehrman T, Weng G, Zhou P, Drmanac RT, Wang Z; 

PI Ma Y, Wang D, Chen R, Xu C, Boyle BJ; 
XX 

DR WPI; 2003-569235/53. 

DR P-PSDB; ADE08404. 

XX 

PT New polynucleotides, useful for expressing recombinant proteins for 

PT analysis, characterization or therapeutic use, or as markers for tissues 

PT in which the corresponding protein is preferentially expressed. 

XX 

PS Claim 1; SEQ ID NO 559; 1177pp; English. 
XX 

CC The invention comprises the amino acid and coding sequences of novel 

CC proteins. The DNA and protein sequences of the invention are useful as: 

CC markers for tissues in which the corresponding protein is preferentially 

CC expressed; as molecular weight markers on gels; as chromosome markers or 

CC tags; to identify chromosomes or to map related gene positions; and to 

CC compare with endogenous DNA sequences in patients to identify potential 

CC genetic disorders. The present DNA sequence represents a gene of the 

CC invention. 
XX 

SQ Sequence 1617 BP; 374 A; 484 C; 474 G; 285 T; 0 U; 0 Other; 

Query Match 23.4%; Score 206.4; DB 9; Length 1617; 

Best Local Similarity 99.5%; Pred. No. 2.8e-40; 

Matches 20 7; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 186 GAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCT 2 45 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1410 GCAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCT 146 9 

Qy 2 46 GAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCAT 3 05 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1470 GAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCAT 1529 
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Qy 



306 




Db 



1530 



Qy 



366 



TGATGGAGAAATCGACCCAGAAGCCTGA 393 




Db 



1590 



TGATGGAGAAATCGACCCAGAAGCCTGA 1617 



RESULT 13 
AFS82561 

ID AFS82561 standard; DNA; 203 BP. 
XX 

AC AFS82561; 
XX 

DT 20-SEP-2007 (first entry) 
XX 

DE Human transcript sequence, SEQ ID 1960. 
XX 

KW DNA detection; RNA detection; exon; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200268579-A2 . 
XX 

PD 06-SEP-2002. 

XX 

PF 10-JAN-2002; 2 0 02WO-US0 0 02 8 4 . 
XX 

PR 10-JAN-2001; 2 0 0 1US-0 0 75 6 6 9 6 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter CJ, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2002-682812/73. 
XX 

PT New isolated nucleic acid detection reagent for detecting the presence of 

PT specified human exons . 

XX 

PS Claim 4; SEQ ID NO 1960; 40pp; English. 

XX 

CC The present invention relates to a novel isolated nucleic acid detection 

CC reagent for detecting the presence of specified human exons . The exon 

CC sequences cover every identified human transcript and exon comprising 

CC every gene/coding region of the human genome. The present sequence is one 

CC such exon sequence. The nucleic acid detection agent is used for 

CC detecting the presence of at least 100000, at least 2000, at least 50000 
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CC or at least 10000 human exons . The sequences that span exon-exon 

CC junctions eliminate false signals caused by genomic contamination. This 

CC is because a detection element comprising two neighboring exons as one 

CC contiguous sequence will not hybridize to genomic DNA comprising 

CC intervening intronic DNA. These detection elements will only hybridize to 

CC expressed mRNA transcripts in which the exons are connected and the 

CC intronic sequence has been removed, therefore forming one contiguous 

CC stretch of sequence corresponding to the sequence of the detection 

CC element that spans the exon-exon junction. 

XX 

SQ Sequence 203 BP; 49 A; 65 C; 60 G; 29 T; 0 U; 0 Other; 



Query Match 23.0%; Score 2 03; DB 7; Length 2 03; 

Best Local Similarity 100.0%; Pred. No. 9.2e-40; 

Matches 2 03; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 191 GCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACG 25 0 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACG 6 0 



Qy 251 ACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGC 310 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGC 12 0 



Qy 311 TGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATG 3 70 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATG 180 



Qy 3 71 GAGAAAT C GAC C C AGAAGC C T GA 393 

I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GAGAAATCGACCCAGAAGCCTGA 2 03 



RESULT 14 
ACH77410 

ID ACH77410 standard; DNA; 550 BP. 
XX 

AC ACH77410; 
XX 

DT 29-JUL-2004 (first entry) 
XX 

DE Human genome derived single exon probe #10605. 
XX 

KW Human; probe; ss; gene expression; single exon probe; microarray; 

KW alternative splicing event; genomic alteration. 

XX 

OS Homo sapiens . 
XX 

PN US2003194704-A1 . 
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XX 

PD 16-OCT-2003. 
XX 

PF 03-APR-2002; 2002US-00029386 . 
XX 

PR 03-APR-2002; 2002US-00029386 . 
XX 

PA (PENN/) PENN S G . 

PA (RANK/) RANK D R. 

PA (HANZ/) HANZEL D K. 
XX 

PI Penn SG, Rank DR, Hanzel DK; 
XX 

DR WPI; 2004-119264/12. 
XX 

PT New human genome-derived single exon nucleic acid probes useful for human 

PT gene expression analysis, for identifying or characterizing alternative 

PT splicing events, for assessing genomic alterations or as tools for 

PT surveying tissues. 
XX 

PS Claim 15; SEQ ID NO 10605; 80pp; English. 

XX 

CC The invention relates to a nucleic acid probe for measuring human gene 

CC expression, comprising any of the 27, 400 fully defined nucleotide 

CC sequences in the specification, or their complements or fragments, and 

CC encoding at least 8 amino acids of any of the 6888 amino acid sequences 

CC fully defined in the specification. The probe is a single exon probe that 

CC hybridises under high stringency conditions to a nucleic acid molecule 

CC expressed in human cells or tissues. Also included are a spatially- 

CC addressable set of single exon nucleic acid probes for measuring human 

CC gene expression (comprising a plurality of single exon nucleic acid 

CC probes cited above, where each of the plurality of probes is separately 

CC and addressably isolatable or amplif iable from the plurality) , a single 

CC exon microarray for measuring human gene expression, a method of 

CC measuring human gene expression, a vector comprising the single exon 

CC probe cited above, an ORF-encoded peptide comprising at least 8 

CC contiguous amino acids of any of the above- mentioned amino acid 

CC sequences (optionally with conservative amino acid substitutions), an 

CC isolated antibody that binds specifically to a peptide cited above, 

CC methods of selling and/or licensing single exon probes or microarrays to 

CC a customer desiring to measure gene expression, a method of providing 

CC human gene expression data by subscription, and a computer-readable 

CC storage medium which contains a database having a plurality of records 

CC (each record including data on the expression of a single exon probe 

CC cited above. The probe, methods and apparatus are useful in gene 

CC expression analysis. The probes may be used as tools for surveying 

CC tissues to detect the presence of expressed messages that contain their 

CC specific exon, or in constructing genome-derived single exon microarrays. 

CC In addition, the probes are used in identifying and characterising 
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CC alternative splicing events, in detecting and characterising gross 

CC alterations in the genomic locus that includes their exon, in assessing 

CC smaller genomic alterations, in priming the synthesis of nucleic acids, 

CC or in expressing the ORF-encoded peptide. The present sequence is a human 

CC single exon probe of the invention. Note: The sequence data for this 

CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from USPTO at 

CC seqdata .uspto . gov/ sequence . html ?DocID=2 0 03 019 4 70 4 

XX 

SQ Sequence 550 BP; 92 A; 186 C; 174 G; 98 T; 0 U; 0 Other; 



Query Match 16.7%; Score 147.2; DB 12; Length 550; 

Best Local Similarity 95.0%; Pred. No. 7.2e-26; 

Matches 152; Conservative 0; Mismatches 8; Indels 0; Gaps 0; 

Qy 175 ATGTCCTGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAG 23 4 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 ACGCCTGGCTTCTCAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAG 145 



Qy 235 GCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACC 294 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 GCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACC 2 05 



Qy 295 CCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAG 33 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 CCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAG 245 



RESULT 15 
ACH91110 

ID ACH91110 standard; DNA; 170 BP. 
XX 

AC ACH91110; 
XX 

DT 29-JUL-2004 (first entry) 
XX 

DE Human genome derived single exon probe #24305. 
XX 

KW Human; probe; ss; gene expression; single exon probe; microarray; 

KW alternative splicing event; genomic alteration. 

XX 

OS Homo sapiens . 
XX 

PN US2003194704-A1 . 
XX 

PD 16-OCT-2003. 

XX 

PF 03-APR-2002; 2002US-00029386 . 
XX 
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PR 03-APR-2002; 2002US-00029386 . 
XX 

PA (PENN/) PENN S G. 

PA (RANK/) RANK D R. 

PA (HANZ/) HANZEL D K. 
XX 

PI Penn SG, Rank DR, Hanzel DK; 
XX 

DR WPI; 2004-119264/12. 
XX 

PT New human genome-derived single exon nucleic acid probes useful for human 

PT gene expression analysis, for identifying or characterizing alternative 

PT splicing events, for assessing genomic alterations or as tools for 

PT surveying tissues . 
XX 

PS Claim 1; SEQ ID NO 24305; 80pp; English. 

XX 

CC The invention relates to a nucleic acid probe for measuring human gene 

CC expression, comprising any of the 27, 400 fully defined nucleotide 

CC sequences in the specification, or their complements or fragments, and 

CC encoding at least 8 amino acids of any of the 6888 amino acid sequences 

CC fully defined in the specification. The probe is a single exon probe that 

CC hybridises under high stringency conditions to a nucleic acid molecule 

CC expressed in human cells or tissues. Also included are a spatially- 

CC addressable set of single exon nucleic acid probes for measuring human 

CC gene expression (comprising a plurality of single exon nucleic acid 

CC probes cited above, where each of the plurality of probes is separately 

CC and addressably isolatable or amplif iable from the plurality) , a single 

CC exon microarray for measuring human gene expression, a method of 

CC measuring human gene expression, a vector comprising the single exon 

CC probe cited above, an ORF-encoded peptide comprising at least 8 

CC contiguous amino acids of any of the above- mentioned amino acid 

CC sequences (optionally with conservative amino acid substitutions), an 

CC isolated antibody that binds specifically to a peptide cited above, 

CC methods of selling and/or licensing single exon probes or microarrays to 

CC a customer desiring to measure gene expression, a method of providing 

CC human gene expression data by subscription, and a computer-readable 

CC storage medium which contains a database having a plurality of records 

CC (each record including data on the expression of a single exon probe 

CC cited above. The probe, methods and apparatus are useful in gene 

CC expression analysis. The probes may be used as tools for surveying 

CC tissues to detect the presence of expressed messages that contain their 

CC specific exon, or in constructing genome-derived single exon microarrays. 

CC In addition, the probes are used in identifying and characterising 

CC alternative splicing events, in detecting and characterising gross 

CC alterations in the genomic locus that includes their exon, in assessing 

CC smaller genomic alterations, in priming the synthesis of nucleic acids, 

CC or in expressing the ORF-encoded peptide. The present sequence is a human 

CC single exon probe of the invention. Note: The sequence data for this 
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CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from USPTO at 

CC seqdata.uspto.gov/sequence . html?DocID=2 0 03 019 4 70 4 

XX 

SQ Sequence 170 BP; 37 A; 64 C; 47 G; 22 T; 0 U; 0 Other; 

Query Match 16.3%; Score 144; DB 12; Length 170; 

Best Local Similarity 100.0%; Pred. No. 3e-25; 

Matches 144; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 191 GCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACG 25 0 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACG 6 0 

Qy 251 ACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGC 310 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGC 12 0 

Qy 311 TGCCGAGGGACTTGGGGAGCACAG 33 4 

I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TGCCGAGGGACTTGGGGAGCACAG 144 
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